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We prove a de Finetti theorem for exchangeable sequences of states on test spaces, where a test 
space is a generalization of the sample space of classical probability theory and the Hilbert space of 
quantum theory. The standard classical and quantum de Finetti theorems are obtained as special 
cases. By working in a test space framework, the common features that are responsible for the 
existence of these theorems are elucidated. In addition, the test space framework is general enough 
to imply a de Finetti theorem for classical processes. We conclude by discussing the ways in which 
our assumptions may fail, leading to probabilistic models that do not have a de Finetti theorem. 
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I. INTRODUCTION 

There are many scenarios involving probabilistic rea- 
soning about a large number of systems, where it is im- 
portant that these systems can be regarded as identi- 
cal and independent. Classical parameter estimation and 
quantum tomography, in which a source is calibrated by 
measuring separately a number of systems it has pro- 
duced, provide excellent examples. More generally, much 
of experimental science involves repetition of an experi- 
ment, with conclusions drawn from observed relative fre- 
quencies, where the conclusions are only valid under the 
assumptions of identity and independence of the separate 
trials. A problem that arises, therefore, is what justifies 
these assumptions? De Finetti theorems are designed to 
answer this question. 

De finetti theorems have a fundamental significance 
for the subjective Bayesian interpretation of probabili- 
ties. They enable the subjective Bayesian to explain why 
a rational agent treats a sequence of trials as identical 
and independent, without appealing to a notion of ob- 
jective chance, or to unknown probabilities. In addition 
to this fundamental significance, they are also important 
technical tools. For example, quantum de Finetti theo- 
rems have been applied to various problems in quantum 
information theory, including proofs of the security of 
quantum key distribution [l[ , algorithms for deciding the 
separability of bipartite quantum states 0, Q and global 
optimization of the maximum output purity of quantum 
channels Q. 

In this paper, we provide a de Finetti theorem for 
nonsignalling states on test spaces. A test space is a 
generalization of the sample space of classical proba- 
bility theory and the Hilbert space of quantum the- 



ory. The de Finetti theorem for test spaces includes 
the classical and quantum theorems as special cases. 
It also implies a de Finetti theorem for classical pro- 
cesses, which can also be viewed as a theorem about 
states in theories that exhibit "superquantum" correla- 
tionSj_as have been studied recently in quantum informa- 
tion |, i 0, 1, i, Eft EH, Ei Ei EI EE M E3 ■ 

The remainder of this paper is structured as follows. 
In STn) the classical and quantum de Finetti theorems 
are reviewed and their relevance for Bayesian statistics is 
discussed. In mill the test space framework is introduced 
and its connection to the convex sets framework used in 
[H E! is explained. £[V] states the de Finetti theorem 
for test spaces and $V] outlines some of its consequences, 
including the classical and quantum theorems, and the 
theorem for classical processes. WII discusses the role 
of the various assumptions of the test space framework 
and outlines some more general scenarios in which the 
theorem fails. Will concludes. 



II. 



THE CLASSICAL AND QUANTUM DE 
FINETTI THEOREMS. 



De Finetti introduced his theorem in the context of 
a subjective Bayesian approach to probability theory 
[20l I2H . In this approach, probabilities are not defined 
as limiting relative frequencies, nor as objective proper- 
ties of the physical world. Instead they are measures 
of the degrees of belief of a decision making agent. An 
immediate question is, why should degrees of belief be 
represented by real numbers obeying the Kolmogorov 
axioms? De Finetti's answer is provided by his famous 
Dutch Book argument j2(| [2l[ 1 . But a second question 
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is, why do the usual rules of statistical inference apply 
to these quantities? In particular, how and why should 
relative frequencies be used to update probability assign- 
ments? 

Consider an experiment in which a trial with d pos- 
sible outcomes is repeated n times. Under a standard 
sort of analysis, the trials are first judged to be inde- 
pendent and identically distributed, so that the proba- 
bility of getting the outcome sequence Xi, . . . ,X n is given 
by P n (x\, . . . , x n ) — p{xi) x • • • x p(x n ), for some "un- 
known" probability distribution p. The distribution p is 
a parameter to be estimated. 

To the subjective Bayesian, however, this is problem- 
atic since no sense can be given to an "unknown" prob- 
ability. De Finetti provides an alternative analysis. If 
the trial can in principle be repeated an arbitrary num- 
ber of times, then the joint distribution over outcome 
sequences P" should be defined for any n, so consider 
an infinite sequence of distributions P , P 2 , . . . Suppose 
that for the first n trials, the agent is indifferent as to 
whether any further trials are actually performed or not, 
and that the agent is also indifferent as to the order in 
which the outcomes are reported. This suggests that the 
sequence P 1 , P 2 , . . . should satisfy the following. 

Definition 1. P n is symmetric if and only if it is in- 
variant under permutations of the n tests. That is, 
P n (xi, . . . , x n ) = P n (x 7T (i), ■ ■ ■ , %n(n)) f or all permuta- 
tions it : {1, 2, . . . , n} — > {1, 2, . . . , n}. 

Definition 2. The sequence P 1 , P 2 , . . . is exchangeable 
if and only if 

1. \/n, P n is symmetric, 

2. Vrc, Vxi, . . . ,x n , 

P {x\ , . . . , Xn) — SaJ„ +1 =l P ^ (""1 > • • • ) x n+l ) • 

Theorem 1 (de Finetti's representation theorem (2~il . 
l24j]). If the sequence P 1 , P 2 , . . . is exchangeable, then P n 
can be written in the form 



P n [x x 



i %n ) 



d/x(p) p(xi)---p(x n ), (1) 



where Ad is the set of all probability distributions over 
the outcomes {1, ...,d}, \i is a probability measure on 
Ad, \i is independent of n, and /i is unique. 

This is the classical de Finetti theorem for infinite se- 
quences and a finite number of outcomes. It shows that if 
the sequence P 1 , P 2 , . . . is exchangeable, then P n can be 
written as if it were generated via a probability distribu- 
tion p, over unknown probabilities p. In particular, if the 
first m < n trials are performed, and standard Bayesian 
updating applied directly to the joint distribution P", 



one finds that the posterior probability for the remaining 
ri — m trials is given by 



P (^m+l , ■ ■ • , -En |^1 , ■ • 

dn(p\xi, 



) p(x m +l) ■ ■ -p(x n ), (2) 



where fj, is updated as if Bayesian conditioning had been 
performed on an unknown parameter p directly: 



dn(p\x! 



dfijp) x p(xi) x ■ ■ ■ x p(x m ) 
P n (x\ , . . . , x m ) 



The quantum de Finetti theorem is a generalization 
of the classical theorem. It was first presented in 
Refs. [Hii], and a simpler proof given in Refs. [UHl]. 
Whereas the classical theorem concerned the outcome 
probabilities of a test that could be repeated an arbitrar- 
ily large number of times, the quantum theorem concerns 
the joint state of an arbitrarily large number of quantum 
systems. Suppose that each of these systems is associ- 
ated with a (i-dimcnsional Hilbert space Hd- The joint 
state of n systems is then a density operator on the tensor 
product Hilbert space Hf n . Exchangeability is defined 
for a sequence of states ur, lu 2 , . . ., where uj n is a state of 
n systems. 

Definition 3. uj n is symmetric if and only if it is in- 
variant under permutations of the n systems, i.e., 



Tr(Qi 



Q n u> n ) =Tr(Q 7r(1) 



for all permutations it : {1, 2, . . . , n} — ► {1, 2, . . . , n}, and 
for any projection operators Q\, . . . , Q n . 

Note that this is equivalent to requiring that uj n = 
St T oj ti S} t for all permutations it, where is the operator 
that permutes the n systems according to tt. 



Definition 4. The sequence oj 1 ,^ 2 , 
if and only if 

1. Vn, uj n is symmetric, 



is exchangeable 



2. "in, ui n = Tr n+ i(dj™ +1 ) , where Tr„ + i is the partial 
trace over the n + 1st system. 



Theorem 2 (the quantum de Finetti theorem). // the 
sequence u> , u> 2 , ... is exchangeable, then u> n can be writ- 
ten in the form 



uj n = / dfi(uj) UJ I 
Jn 



(3) 



where Q is the set of density operators on Hd, (i is a 
probability measure on Q, [i is independent of n, and fi 
is unique. 

From a fundamental point of view, the quantum de 
Finetti theorem is particularly important for those ap- 
proaches to quantum theory that take a subjective view 
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of the quantum state [29|, [30j, |3jJ, [32, [33J. These ap- 
proaches are closely related to the Bayesian view of prob- 
abilities. A quantum state is taken to represent the de- 
grees of belief of an agent, where these might be beliefs 
about the potential outcomes of measurements. In this 
case, the notion of an unknown quantum state, so preva- 
lent in the literature, becomes problematic. The quan- 
tum de Finetti theorem shows how to dispense with this 
notion, at least in some situations. 

Both of the theorems just presented assume finite sam- 
ple spaces (or finite dimensional Hilbert spaces), and con- 
cern a number of trials or systems that tends to infinity. 
Both have been generalized in a number of ways. Classi- 
cal theorems for a finite number of trials are discussed in 
[li], l35l l3o| and quantum theorems for a finite number of 
systems in 0, [13, [H, HH, H(| . In addition to the quantum 
de Finetti theorem, there is also a de Finetti theorem for 
quantum operations [28l l4lj , for representations of uni- 
tary g roup s [40l |42| and for unitarily invariant quantum 
states 431. 



III. TEST SPACES 

The classical de Finetti theorem involves probabilities 
of outcome sequences for a test that can in principle be 
repeated an arbitrarily large number of times. The quan- 
tum de Finetti theorem involves the joint quantum state 
of a number of quantum systems that can in principle be 
arbitrarily large. Both of these can be viewed as special 
cases of a more general scenario. Suppose that a single 
system is associated with a number of different possible 
tests, which are mutually exclusive in the sense that only 
one can be performed at a time. The classical case is re- 
covered when there is in fact only one such test, and the 
quantum case when the tests correspond to the different 
possible measurements on a quantum system. A state is 
an assignment of probabilities to the outcomes of all the 
different possible tests. 

Assume further that given n systems, a test for each 
system can be independently chosen, and that a joint 
state is an assignment of probabilities to outcome se- 
quences for each possible sequence of tests. Given all 
this, it is possible to define exchangeability for a sequence 
of states and to prove a de Finetti representation theo- 
rem. These ideas are formalized in this section, with the 
representation theorem given in the next. 



A. Single systems 

The technical notion that we use to describe a single 
system is that of a test space. Test spaces were introduced 
with the explicit purpose of describing probabilistic mod- 
els more general than classical and quantum theory, but 
including both as special cases 0, |45| . 




FIG. 1: A Greechie diagram of the test space 
({a, b, c, d, e, /, g}, {{a, b, c, d}, {a, e, g}, {b, e, /}}). 



— o 



o O O— 



FIG. 2: Greechie diagram for classical probability theory over 
a finite set. 



E a set, and S a set of countable subsets of E that covers 
E. 

The idea is that each element of E is a possible out- 
come of a test. Each set s G S corresponds to a possible 
test, with the elements of s being the outcomes of that 
test. The sets in S may overlap, thus the definition of 
a test space is designed to allow for the possibility that 
outcomes of two different tests are identified. The sets 
E and S themselves may have any cardinality, but the 
definition stipulates that the outcomes of any particular 
test are countable. 

For finite E, a test space can be conveniently summa- 
rized by a Greechie diagram (46|. Each element of E is 
represented by a circle, and tests are represented by con- 
necting the corresponding set of circles with a continuous 
line. Examples of Greechie diagrams are given in Figs. 

nsi 

Definition 6. A state on a test space is a map to : E — > 

[0, 1] satisfying ^ ees w(e) = 1 Vs G S. 

A state defines probabilities for the outcomes of each 
test such that (i) these probabilities sum to 1 for each 
test, and (ii) if an outcome appears in more than one 
test it gets the same probability in each case. Given a 
test space A, write the set of all possible states il(A). 
Note that it is easy to construct test spaces for which 
Q(A) is the empty set, or for which a particular outcome 
has probability in all states, or which may for simi- 
lar reasons be judged unsatisfactory. Extra assumptions 
would rule these out, but here there is no need. 

The set of possible state spaces of a test space is 
generic, in the sense that every finite dimensional con- 
vex set arises as the set of states for some test space. 2 



2 This statement extends to infinite dimensions. Precisely: ev- 
ery convex subset of a locally convex topological vector space is 



Definition 5. A test space consists of a pair (E, S), with affinely homeomorphic to the set of all states on some test space. 
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As noted above, discrete classical probability theory is 
recovered when there is only one test, i.e., when A is of 
the form (E, {E}) (see Fig. [2]). It is also possible to re- 
cover classic al p robability theory over an arbitrary mea- 
surable set [481 ] . Quantum theory with projective mea- 
surements is recovered when A = (V(H), M{H)), with 
V(H) the set of projection operators on a Hilbert space 
H and A4(H) the set of projective decompositions of the 
identity. In this case, Gleason's theorem [49| implies that 
each state corresponds to a density operator p, with the 
probability assigned to projector P given by Tr(pP) 3 . 

It will be useful to define arbitrary linear combinations 
of states. Given states uj\,...,uJk € ^(-4), the linear 
combination v — r-jWi, for real r%, is defined as a map 
E — > that satisfies 



v(e) 



riUJi{e) Ve £ E. 



(4) 



The set of all linear combinations of states is a vector 
space denoted V(A). In the case of quantum theory, 
for example, V(.A) is the real vector space of Hermitian 
operators on the Hilbert space. Clearly, fl(A) is a convex 
subset of V(A). Since Q(A) by definition spans V(A), 
they have equal dimension. Importantly, from hereon we 
assume the following. 

Assumption 1. V(A) is finite dimensional. 

Finally, let V*(A) be the vector space dual to V(A), 
that is the set of all linear maps V(A) — > SR. Note 
that each outcome of a test, that is each e £ E, can 
be uniquely identified with a map e £ V*(A) such that 
e(uj) = o;(e) Vcj £ ft(A). Under this identification one 
can write w(e) and e(w) interchangeably, according to 
whether states are viewed as assigning probabilities to 
outcomes of tests, or vice versa. The set E can be viewed 
as a subset of V* (A) , and it is easy to see that the span 
of E is equal to V*(A). 

Before moving on to composite systems, we briefly note 
that there is an alternative approach to operational prob- 



In [471 this was proved for the state space of an orthomodular 
lattice, but the set of all finite ortho-partitions of unity of the 
lattice is a test space that has the same state space. 
3 This holds provided the dimension of the Hilbert space is > 3. 
A generalization of Gleason's theorem to encompass positive op- 
erator valued (POV) measurements does hold for dimension 2 
|50i |51|| , but the formalism of test spaces is not general enough 
to encompass these measurements. This is because a POV de- 
composition of the identity can contain multiple instances of the 
same term, such as {1/2,1/2}, and states are constrained to as- 
sign the same probability to each instance. In the test space 
formalism, different outcomes of the same test are always con- 
sidered distinct. For precisely this reason, generalizations of test 
spaces known as effect- test spaces have been studied that do en- 
compass quantum POV measurements [52t l53l|. However, there 
is no real need to consider them here because we are primarily 
concerned with properties of states, and the set of possible state 
spaces of an effect-test space is no more general than that of a 
test space. 



abilistic theories that has recently been used to investi- 
gate the information theoretic properties of such theo- 
ries [1, [H, HH . In this approach one starts with a com- 
pact convex set f2, to be interpreted as a space of states, 
and defines measurement outcomes to be the set of affinc 
functionals / : Q — ► [0, 1]. The present work could easily 
have been formulated in this framework, but it would be 
odd to do so from a subjective Bayesian point of view. 
If the states are supposed to represent degrees of belief 
then it makes sense to start with the objects that they 
are degrees of belief about, i.e. the tests, rather than the 
states themselves. There is no loss of generality in work- 
ing with test spaces, since the result of [I?} implies that 
any compact convex set can arise as the state space of a 
test space in the finite dimensional case. 



B. Multi-partite systems 

In order to consider multi-partite systems, one needs to 
consider the composition of test spaces and the definition 
of joint states. Suppose that two systems A and B are 
associated with test spaces A — {E,S) and B = (F,T). 
If the combined system is regarded as a system in and of 
itself, then it too should be associated with a test space. 
But how is this constructed, and how is it related to A 
and Bl Nothing that has been said with respect to single 
systems implies a unique answer, so further assumptions 
are needed. 

Suppose that given separate systems A and B, it is 
possible to perform any test s on system A simultane- 
ously with any test t on system B. A joint state assigns 
probabilities to pairs (e, /) of outcomes. In particular, if 
e € s and e £ s', then the probability of obtaining (e, /) 
does not depend on whether the tests performed are s 
and t, or s' and t. Similarly if / 6 t and / £ t'. Suppose 
further that a specification of the joint probability for all 
outcome pairs serves to define the joint state uniquely. 

This motivates the following. 

Definition 7. Given two test spaces, A — (E, S) and 
B = (F,T), the Cartesian product, Ax B, is a new test 
space whose set of outcomes is the set theoretic Carte- 
sian product E x F, and whose set of tests is {s x t \s £ 
S,t £ T}, where sxt is again the set theoretic Cartesian 
product. 

From Definition [6] a state on A x B is a map E x F — > 
[0,1], with probabilities summing to 1 for each pair of 
tests (s,t). 

Definition 8. Consider a bipartite system correspond- 
ing to a test space Ax B. A state lu £ fl(A x B) is 
nonsignalling if and only if 



]T co(e,f) = J2 w(e,/) Ve,M' 



/et 



(5) 



(6) 
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If a state is nonsignalling, then the marginal probabil- 
ity of obtaining outcome e for test s does not depend on 
which B test is performed. This means that a marginal 
state loa G Q(A), can be defined such that 

WA (e)=5>(e,/), (7) 
fet 

where the right hand side does not depend on the choice 
of t. Similarly, one can define a marginal lob G fl(B). 

Definition 9. Given ui A G Cl(A) and lob G ^(B), the 
direct product, lo a ®uj b G £l(A x S), is defined so that 

(<J A ®w B )(e,f)=uj A (e)LU B (f) V(e,f)eExF. (8) 

The definition of the Cartesian product of test spaces 
is valid for any pair of test spaces, including the case in 
which one of them is itself a Cartesian product. Con- 
sidering three test spaces, A = (E,S), B — (F,T), and 
C = (G, U), it is easy to see that (Ax B) xC is isomorphic 
to A x (B x C). Thus one can simply write AxBxC, with 
states on A x B x C identified with maps ExFxG — > [0,1]. 
The product A x A x ■ • • x A, where there are n terms 
in the decomposition, can be written A xn . 

The notion of a nonsignalling state has been defined 
with respect to bipartite decompositions. It extends 
readily to the case of an n-fold product. 

Definition 10. Consider a product of test spaces, A\ x 
■■■ x An- A state ui € fi(«4i x ••• x A n ) is n-fold 
nonsignalling if and only if to is nonsignalling with respect 
to every bipartite split. More formally, let a be a subset 
of {1, . . . , n}, let \a\ — k, and write a — . . . , ik}- 
Then lo is n-fold nonsignalling iff 

e »i e >d e **fc 

= X] ■" X] w(ei ' ■ • • ' e ™)' 

for all a, for all ej with j £ a, and for all tests t il , . . . , t ik 
and t' h , . . . , t' ik . 

Finally, this will be useful: 

Lemma 1. Consider the test space A\ x ■ ■ ■ x A n - The 

direct product states, of the form tO\ ® • • • ® u) n , span a 
subspace of V(Ai x ■ ■ ■ x A n ), and the subspace can be 
identified with V(A\) ® • • • <g) V(A n ). If a joint state lo 
is n-fold nonsignalling, then lo G V(A\) <S> ■ ■ ■ <8> V(A n ), 
i.e., lo can be written as a linear combination of direct 
products. 

Proof. Begin with the n = 2 case. The tensor product 
V(A\)®V(A2) can be defined as the set of bilinear maps 
V*(Ai) x V*(A 2 ) 3i. Any direct product u>i <gi u> 2 
defines such a map via (a, b) — > a(u>i)b(u)2) , and it is 
straightforward that these span V(A\) ® V(A2). Now 
consider a nonsignalling joint state lo. The fact that lo 



is nonsignalling permits the definition of the marginal 
state lo\. Define the conditional state w 2 | e such that 
w 2|e(/) i s the probability of outcome / on system 2, 
given that outcome e was obtained on system 1. Thus 
Lo(e, f) = uji(e)uo 2 \ e {f), for all (e, f) E E x F. Note that 
w 2|e G ^(A 2 ). Now suppose that / and {gi} are elements 
of F such that, considered as elements of V*(A 2 ), f = 
Y,i r i9i- Tllus ^(.f) = Ei r ^2(5i) f or a U w 2 G fl(A 2 ). 
Then uj(e, f) = Wi(e)w 2 | e (/) = Y,i r i u A e ) u i\e{9i) = 
'^2 i rito(e, gi). A value w(e, b) can now be defined for ar- 
bitrary b G T^*(»42) by linear extension. Similar reason- 
ing concludes that Lo(e, b) is linear in the first argument, 
hence can be extended to Lo(a,b) for a G V*(A\). So cj 
defines a bilinear map („4i) x V* (A 2 ) — > 5R as required. 
The extension to general n is straightforward. □ 

Note that if we have two quantum test spaces A = 
(V(H A ),M(H B )) and B = (V(H B ), M{H B )) then the 
space of nonsignalling states on A x B is larger than the 
state space of AB = {V{H A ®H B ), M(H A <E)H B )). To see 
this, recall that the nonsignalling state space of A x B 
only has to be positive for all possible choices of local 
measurements on A and B, whereas the test space AB 
includes joint measurements, such as the Bell measure- 
ment for example. Thus, the criteria to be a state on AB 
are more restrictive than those for nonsignalling states 
on Ax B. Indeed, if we take a state on AB and perform 
a positive, but not completely positive, map on system A 
then the result is still a valid state on Ax B, but not on 
AB in general, e.g. consider performing a partial trans- 
pose on a Bell state. Nevertheless, the state space of 
AB is still a convex subset of the nonsignalling states on 
AxB, which is enough to apply our theorem. More gener- 
ally, one might want to consider rules for composing sub- 
systems that yield a convex subset of the nonsignalling 
states on the Cartesian product for arbitrary test spaces. 

IV. A DE FINETTI THEOREM FOR TEST 
SPACES 

Given a system associated with a test space A = 
(E,S), suppose that n copies are associated with the 
product A xn . Given an infinite sequence of states 
lo x ,lo 2 ,... where io n G il(A xn ), it is possible to define 
symmetry and exchangeability in a manner similar to 
the classical and quantum cases. The main difference is 
that here, the definition of exchangeability involves the 
extra condition that the states are nonsignalling. 

Definition 11. A state lo u G ft (A xn ) is symmetric if 
and only if it is invariant under permutations of the n 
systems. That is, 

w"(ei, . . . , e„) = w n (e 7r(1) , . . . , e 7r( „ ) ), 

for all permutations tt : (1, . . . , n) — > (1, . . . , n). 

Definition 12. A sequence of states lo 1 ,^ 2 ,... where 
Lo n G £1(„4 X "), is exchangeable if and only if 
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1. Vn w™ is symmetric, 

2. Vn u> n is n-fold nonsignalling, 

3. w"(ei, . . . , e„) = E e „ +lS;5 w " +1 ( e i) ■ ■ ■ , e„, e n+i ). 

Theorem 3 (The de Finetti theorem for test spaces). 
Suppose that the sequence to 1 , uj 2 , . . . where u> n € 
D(„4 xn ), is exchangeable. Then u> n can be written in the 
form 



d/u,(w) CO I 



g> u>, 



(9) 



where \x is a probability measure on (1(A), \i is indepen- 
dent of n, and /i is unique. 

Proof. The proof is an adaptation of the proof of the 
quantum de Finetti theorem due to Caves et. al. [13] • 
Recall that in quantum theory, an informationally com- 
plete measurement is a positive operator-valued (POV) 
measurement such that if the outcome probabilities are 
all known, then the state is determined uniquely. The 
strategy of Caves et al. is to generate a classical distri- 
bution by considering an informationally complete POV 
measurement performed separately on each quantum sys- 
tem. Applying the classical de Finetti theorem to the 
distribution of outcome sequences allows the form of the 
quantum state to be inferred. 

In our context, the test space A need not include a 
test that is informationally complete for the state space 
n. But for the purposes of proof, this does not mat- 
ter. All that is needed is a corresponding mathematical 
construction. 

Lemma 2. There exists a set M — {ai, . . . , a^}, where 
di G V*(A) and d is the dimension ofV*(A), such that 

1. M is a basis for V*(A), i.e., the ai are linearly 
independent, 

2. < Oi{u)) <lVwe (1(A), 

s. = 1 VtJ e n ( A )- 

Proof. This result is not new. It is also used in Ref. [HI], 
and we give the same proof. A more general version 
is proven in Ref. [541 ]. Let u be the unique vector in 
V*(A) such that u(u) = 1 for all uj 6 (1(A). Consider an 
arbitrary basis {bi, . . . ,bd} of V*(A). Apply an invert- 
ible linear transformation to obtain {bi, . . . , b c i}, where 
bi = u. Some of the bi may be negative on some 
states uj. Define a constant c as the minimum value of 
bi(uj), where the minimum is taken over all i and all 
lo G (1(A). Then bi — cu is positive and non-zero. It 
follows that ^Api — cu) = (1 — dc)u, with 1 — dc > 0. 
Define ai = (bi — cu)/(l — dc). □ 

The set M will play the role of an informationally com- 
plete measurement. The linear independence of the 
means that a state lo is determined uniquely by the val- 
ues ai(u>). The idea now is that given a nonsignalling 



uj n G f2(„4 xn ), one can at least imagine a measurement 
corresponding to M performed separately on each sys- 
tem. The probability of obtaining an outcome sequence 
(dij , . . . , ai n ) for a direct product state u>\ ® • • • <g> w n 
is defined as (wi) ■ ■ ■ di n (u) n ). Recalling Lemma U 
according to which a nonsignalling state can be writ- 
ten as a linear combination of direct product states, the 
probability of the sequence (a, x , . . . , a,„ ) for an arbitrary 
nonsignalling state can be defined by linear extension. 
Note that this way, the sequence (a^ , . . . , a,„) corre- 
sponds to a vector Oi 1 ®-- > ®Oi n G (V*(A))® n , such that 
the probability is given by (a^ <E> ■ • ■ <S> ai n )(u> n ). The 
vectors <8> • • ■ ® are linearly independent and span 
the tensor product space (V*(A))® n . This means that 
the joint measurement M xn is informationally complete 
for the nonsignalling n-partite system. 

For simplicity, write the probability of the outcome 
sequence (a^ , . . . , di n ) as P n (ii, ■ ■ ■ , i n ) so that 

P n (h,...,i n ) = (a h <g> ■■■®a in )(oj n ). (10) 
If Lo n is symmetric, then so is P n , since 

P n («„(!),..., v ( „)) = (oi w(1) ® •••<g>a i?r(n) )(cj™) 
= (a h ® ■ • ■ ® a in )(n(oj n )) 
= (Ojj ® ••• ® a in )(uj n ) 
= P n (h,...,i n ), (11) 

for any permutation ir. 

Further, if the sequence w 1 ,^ 2 ,... is exchangeable, 
then so is the sequence P 1 , P 2 , . . . , since it is symmet- 
ric and 

J2P n+1 (h,...,i n ,i n +i) 



^2(ah 



< a i n ®ai»+i)(u n+ ) 



ln+1 



= (a i± (g) • • • (g) a in )(co n ) 

= P n (n,...,t n ). (12) 

Now apply the classical de Finetti theorem (Theorem [T]) 
to the sequence P 1 , P 2 , ... to obtain 



P n (ii, . . . ,i n ) = / dfi(p) p(a h ) ■ ■ -p(a in ), (13) 

where is the set of probability distributions over 
{ai, . . . , a d }. 

For each such distribution p, there is a unique ia v G 
V(A) such that ai(oj p ) — p(ai) (where uniqueness follows 
from the linear independence of the a{). Thus Eq. (|13|) 
can be rewritten 



(a n ® ■ ■ ■ ® a ln )(io n ) = \ dfi(p) di^uip) ■ ■ ■ a ln (u p ). 

(14) 

Since the joint measurement is informationally complete 
for the nonsignalling n-partite states, this implies 



jj" = / d[i(p) L0 p g) ■ • ■ g) UJp. 



(15) 
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This is not quite sufficient to establish Theorem [3J 
The right hand side of Eq. (fT5]) is an integral over all 
Lu p G V(A) satisfying < ai(u> p ) < 1 and J2i = 1- 

That is, it is an integral over all lo p that return sensible 
probabilities for the imaginary informationally complete 
measurement. But in general, there are uj p satisfying 
these conditions that are not valid states because they 
return a value < for some element of the test space. 
It remains to show that the integral can be restricted to 
those uj p € Q(A). 

To this end, consider an u p G V(A) such that < 
Oi(wp) < 1 and X^ a i( w p) = 1> but ui p (e) < for some 
e G E. It must be the case that e G s for some test 
s. Let the other elements of s be {/i, . . . , /&} and note 
that 'YlnUp(fi) > 1. There must exist an e > and a 
neighborhood N of ui p in V(A) such that J2i > 1 + e 

for all uj G N. Let N be the subset of such that 
p(di) — a,i(uj) for some u> G N. The expression (jT5j) holds 
for any n, so suppose that n is even and that the test s 
is performed on each of the n systems. The probability 
that outcome e is never obtained is defined by u> n and is 
given by 



o — o- 



-o 



J A (l>^)) 

= [ d/x(p) 

■/A d \AT / 

> (1 + e)" / d/ib), 



(16) 



where we have used the fact that the first term in the 
second line is > if n is even. For large enough n this 
expression is > 1 unless fi(N) = 0. This holds for any 
ujp £ Q(A). It follows that, with a suitable redefinition 
of (l, 



<c" = I dfj,(uj) u <8> • • • <8> u). 

Ifl(A) 



(17) 
□ 



V. CONSEQUENCES OF THEOREM H 

The de Finetti theorem for test spaces is rather gen- 
eral. Very little goes into the definition of a test space 
itself. In fact, in the finite dimensional case we are con- 
sidering, the state space of an individual system may be 
an arbitrary compact convex set. Thus the most sub- 
stantive assumptions that go into the theorem are those 
that concern how test spaces combine when joint systems 
are considered. 

In the classical case, the test space (E, {E}) contains a 
single test and the state space for an individual system is 



(1,1) (2,1) (3,1) 



O O 



-o 



(1,2) (2,2) (3,2) 



O O- 



-o 



(1,3) (2,3) (3,3) 



FIG. 3: The test space for a process with three inputs and 
three outputs. 



Ad, where d is the number of elements of E. In this case, 
the nonsignalling condition is redundant because there 
is just a single test on each system and hence no free- 
dom to choose alternative measurements. The Cartesian 
product of classical test spaces corresponds to the usual 
Cartesian product of sample spaces and so Theorem[3]re- 
duces to Theorem[T]straightforwardly. The quantum case 
is a little more subtle because the state of n quantum sys- 
tems belongs to the state space of (P(H® n ), M{H® n )), 
rather than the state space of the Cartesian product of n 
test spaces of the form (V(H), M(H)). Nevertheless, a 
quantum state defined by a density operator on H® n is 
uniquely specified by the probabilities for measurement 



outcomes of the form Q\ ® Q2 



Q n , where the 



Qi are projection operators. It follows that the states 
on (T{H® n ),M{H® n )) can be identified with a subset 
of the nonsignalling states on (V(H), M.(H)) xn . Hence 
Theorem [3] implies the quantum de Finetti theorem. 

As a further illustration of the generality of the re- 
sult, we note that a theorem for classical processes, or 
conditional probabilities, can also be viewed as a special 
case of Theorem [3l A process C can be thought of as 
taking an input Y into an output A, where Y takes val- 
ues in {1, . . . , k} and X takes values in {1, . . . , d}. The 
process can be defined as the set of conditional probabil- 
ities of the form P(X = x\Y = y) (abbreviated P(x\y)). 
Clearly, the set of all such C can be regarded as the 
set of states on a test space (E,S), where E consists 
of ordered pairs with E = {(x,y)} x =i t „. t d,y=i,...,k and 
S = {{(x, l)}^ = i, ■ • ■ , {(x, fc)}x=i}i as illustrated for the 
d = k = 3 case in Fig. [31 The state space of this test 
space is isomorphic to the set of conditional probability 
distributions via the identification u>((x,y)) = P(x\y). 

More generally, a process C n takes inputs Y±, . . . , Y n 
into outputs Ai, . . . , A„, where each Yi takes values in 
{1, . . . , fc}, and each Aj takes values in {1, . . . , d}. Such a 
C n can be defined as the set of conditional probabilities 
of the form P n (x\, . . . , x n \yi, . . . , y n ), and can also be 
identified with a state on the Cartesian product (E, S) xn . 
The n-fold nonsignalling and symmetry conditions can be 
defined for C n exactly as they are in Definitions [10] and 
ITTI Exchangeability for a sequence C 1 , C 2 , . . . can be 
defined exactly as in Definition [T3J Theorem [3J becomes 
the following theorem for classical processes. 
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Theorem 4. If the sequence C , C 2 , . . . is exchangeable, 
then the conditional probabilities defining C™ can be writ- 
ten in the form 

P(x x , . . . ,x n \y 1 , . . . ,y n ) = / d/j,(p)p(xi\yi)---p(x n \y n ), 

Jn 

(18) 

where fl is the set of processes that take a single input X 
into a single output Y, /i is a probability measure on ft, 
fi is independent of n, and /i is unique. 

A couple of remarks regarding Theorem 2] might be 
helpful. 

First, we described Theorem@]as a special case of The- 
orem[3] It is worth noting the sense in which it is strictly 
less general. After all, any state on any test space could 
be thought of as a classical process, which takes an in- 
put (choice of test) into an output (outcome of test). 
The point is that, with this identification applied to a 
generic test space A, not all sets of conditional proba- 
bilities p{x\y) will correspond to valid states on A. It 
is only if A has the special feature that the tests are 
non-overlapping that this will be the case. The integral 
in Eq.([TS|) ranges over all processes of the form p(x\y), 
whereas in Eq. (J9j) , it is important that the integral ranges 
only over £l(A). 

Second, the dc Finetti theorem for classical processes 
can be viewed as a de Finetti theorem for states in a the- 
ory that is non-classical and non-quantum, but admits 
the most general type of correlations compatible with 
the nonsignalling requirement. This theory is discussed 
in Q, where it is called Generalized Non-signalling The- 
ory. It admits superquantum correlations, which have 
been discussed in the quantum information literature un- 
der the name Popescu-Rohrlich, or nonlocal, boxes. Of 
course, as in the quantum case, exchangeable states do 
not actually exhibit such correlations, since the de Finetti 
theorem shows that they are separable. 

VI. WHEN DOES A DE FINETTI- TYPE 
THEOREM NOT HOLD? 

The de Finetti theorem for test spaces holds thanks 
to a number of assumptions concerning how systems 
combine to make joint systems. One of these is the 
nonsignalling condition for joint states. Others are en- 
coded in the formal definition of a Cartesian product 
of test spaces. It is interesting to see what happens 
when these assumptions are relaxed, so in this section 
we present a number of cases where the theorem fails. 

A. The nonsignalling condition 

First, the assumption that the joint states are 
nonsignalling is crucial not only in the proof, but in the 
very definition of exchangeability. In general, if a state 
u> G ft(A\ x A2) is signalling, then it is not possible to 



define marginal states u>i and u>2- But exchangeability 
requires that, given the n + 1th state in the sequence, 
the marginal state of the first n systems should be de- 
fined and should equal the nth state. In fact, if a state 
u) 6 f2(*4i x A2) is only signalling in one direction then 
one of the marginals can be defined. For example, if prob- 
abilities of outcomes for system 2 depend on which test 
was performed on system 1, but not vice versa, then ui\ 
is well defined. But such a state is not symmetric, thus 
could not form part of an exchangeable sequence. Ar- 
guably, the possibility of performing tests on one system 
that do not affect the other is part and parcel of what 
we mean when we speak of separate systems (or separate 
trials). 



B. Simultaneous measurements 

Implicit in the definition of the Cartesian product of 
test spaces is the idea that a test on one system can 
be regarded as simultaneous with a test on the other 
system. One can certainly imagine rules for combining 
systems where this is not the case. As a very simple 
example, consider two classical bits which have combined 
in the following strange manner. If bit 1 is measured 
before bit 2, then the bits are found to be 00 or 11 with 
equal probability. On the other hand, if bit 2 is measured 
before bit 1, then the outcomes are 01 and 10 with equal 
probability. Note that a suitable no-signalling condition 
is satisfied and it is possible to define marginal states for 
these bits. With more complicated test spaces one can 
construct examples like this which are also symmetric. 4 
We leave open the status of the de Finetti theorem in 
such cases. 



C. Extra degrees of freedom 

Another assumption that is implicit in the Cartesian 
product of test spaces is that the joint state of two sys- 
tems is completely specified by the probabilities for the 
joint outcomes (e, /) of each pair of local tests (s, t). 



4 Consider the test space A, with outcomes E = {a, b, e, d}, and 
two tests corresponding to si = {a, b}, and s2 = {c, d}. Suppose 
that there are two systems, A and B, each described by the test 
space A, and that they have combined as follows. If the test si is 
performed on both systems, the outcomes are completely random 
and uncorrelated. If the test s2 is performed on both systems, 
then the outcomes are completely random and uncorrelated. On 
the other hand, if the test si is performed on either one of the 
systems, followed by s2 on the other, then the joint outcomes 
are ac or bd with equal probability. If the test s2 is performed 
on either one of the systems, followed by si on the other, then 
the joint outcomes are ad or be with equal probability. It is clear 
that this peculiar bipartite system does not allow signalling and 
is also invariant under a permutation of the two systems. 
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One can construct theories in which a joint state does 
indeed determine such probabilities, but is not com- 
pletely specified by them. There are extra degrees of 
freedom, bound up in the two systems, which are in- 
accessible unless some kind of joint operation involving 
both systems at once is performed. 

As discussed in [27j, a clear example of this is pro- 
vided by a modification of quantum theory in which 
real Hilbert spaces are used rather than complex Hilbert 
spaces. States and observables correspond to real sym- 
metric, rather than complex Hermitian, operators. For a 
2-dimensional system (a rebit), there are measurements 
corresponding to x- and z-spin but not y-spin. In this 
case, if a y is the usual Pauli matrix, then 1/4(7 ® I + 
o~ y eg) <7 y ) is an allowed state of two rebits. But the only 
way it can be distinguished from 1/4(7 ® I) is if a joint 
observable such as a y ® a y is measured. Note that in 
real quantum theory, this really is a joint observable: it 
cannot be measured via a separate o~ y measurement on 
each system. 

In [23] it is shown explicitly that the de Finetti theorem 
fails in real quantum theory. If lu" is defined by 

"•'=K^) +K^) <19) 

then it is real and symmetric, and the sequence w 1 , uj 2 , . . . 
is exchangeable. But by the de Finetti theorem for com- 
plex quantum theory, the right hand side of Eq. (|19p is 
the unique de Finetti representation for this sequence. 



states of certain systems, so that there is an effective 
restriction to a convex subset. 

From a foundational point of view, this work can be 
seen as part of a project of understanding what is respon- 
sible for the enhanced information processing power of 
quantum theory, and for the project of deriving quantum 
theory from information theoretic axioms. In particular, 
if one adopts a subjective Bayesian approach to probabil- 
ity, it might be desirable to impose the requirement that, 
in any reasonable theory, one should be able to make 
sense of the idea of reconstructing an unknown state of a 
system by making repeated measurements. Having a de 
Finetti theorem for test spaces means that this does in- 
deed make sense for theories in this framework, and that 
the existing approaches to Bayesian state tomography in 
quantum theory would generalize straightforwardly. 

There are various directions for future work. It would 
be useful to produce a finite de Finetti theorem for test 
spaces. It would also be useful to establish whether a 
(finite or infinite) de Finetti theorem holds without the 
assumption made here of finite dimensionality of state 
spaces. Finally, as discussed in Section IVIB1 it might be 
interesting to explore the status of de Finetti-type the- 
orems in cases where systems combine in non-standard 
ways. 

Note added. Related results have recently been ob- 
tained by M. Christandl and B. Toner (55[, who derive 
a de Finetti theorem for classical processes, analogous to 
Theorem^ of the present work, but extended to the finite 
case. 



VII. CONCLUSION 

In this paper, an infinite de Finetti theorem for test 
spaces has been presented, which generalizes both the 
classical and quantum de Finetti theorems. To illustrate 
the generality of the result, we have shown that a de 
Finetti theorem for classical processes, which may also 
be interpreted as a de Finetti theorem for nonlocal boxes, 
follows as a special case. 

From a practical point of view, proving theorems for 
test spaces, rather than just for quantum theory, confers 
significant advantages. Not only do we achieve a unifi- 
cation of the classical and quantum results, but we also 
obtain results that apply to essentially arbitrary convex 
sets. This is potentially relevant when technological lim- 
itations prevent the preparation of arbitrary quantum 
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